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^^ ', Abstract 
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1 Introduction 

In this paper I discuss some of the more important known results on equilateral sets in finite-dimensional 
normed spaces with an analytical flavour. I have omitted a discussion of some of the low-dimensional 
results of a discrete geometric nature. The omission of a topic has no bearing on its importance, only on 
my prejudices and ignorance. The following topics are outside the scope of this paper: almost-equilateral 
sets, fc-distance sets, and equilateral sets in infinite dimensional spaces. However, see the last section for 
references to these topics. 

I have included the proofs that I find fascinating. The arguments are mostly very geometrical, using 
results from convex geometry and the local theory of Banach spaces, and also from algebraic topology. In 
the case of the £p-norm the known results use tools from linear algebra, probability theory, combinatorics 
and approximation theory. 

Equilateral sets have been applied in differential geometry to find minimal surfaces in finite-dimen- 
sional normed spaces [^ |2H] . 

Throughout the paper I state problems. Perhaps some of them are easy, but I do not know the answer 
to any of them. 

1.1 Definitions 

A subset S* of a metric space {X, d) is X-equilateral (A > 0) if d{x, y) = X for all x,y (£ S,x ^ y. Let 
eA(^) denote the largest size of a A-equilateral set in X, if it exists, otherwise set eA(^) = oo. If X is 
compact then it is easily seen that eA(^) < oo for each A > 0. 

In a normed space X, if S is A-equilateral, then jS is ^-equilateral for any /i > 0; thus the specific 
value of A does not matter, and we only write e{X). If X is finite-dimensional, then e(X) < oo. In 
the remainder of the paper we abbreviate finite-dimensional normed space to Minkowski space. The 
Banach-Mazur distance between two Banach spaces X and Y is the infimum of all c > 1 such that there 
exists a linear isomorphism T : X ^ Y satisfying ||T||||r^^|| < c. In the finite-dimensional case, this 
infimum is of course attained. We always use n to denote the dimension of X, and m the size of a set. We 
use K for the field of real numbers, K" for the n-dimensional real vector space of n-dimensional column 
vectors, and e^ for the ith standard basis vector which has 1 in position i and in all other positions. 
The transpose of a matrix A is denoted by A^'^. For a set S* C M", conv(S'),int(S'), vol(S') denotes its 
convex hull, interior, and Lebesgue measure (if S is measurable), respectively. By log a; we denote the 
natural logarithm, and by Iga; the logarithm with base 2. 

1.2 Kusner's questions for the ip norms 

On M" the ip norm is defined by 

\\{xi,X2,---,Xn)\\p := I ^l^^iT ) 

if 1 < p < oo , and for p = oo by 

\\{xi,X2,-- .,a;„)||oo := maxdxil : z = 1,. ..,n}. 

We denote the Minkowski space (R",||-||p) by £". The £p distance between two points a.b E M" is 
II a — b||p. Thus Euclidean space is £2- 

The following lower bounds for e(^") are simple: 

• Since {±ej : z = 1, . . . , n} is 2-equilateral in £", we have e(^") > 2n. 

• For any p G (1, 00) and an appropriate choice of A = X{p) G R, the set 



{61,62,. . . ,e„, A^JeJ 



is 2^/P-equilateral in dp] hence e{lp) >n + \. 
• {{ei, ...,£„):£, = ±1} is 2-equilateral in ^^; thus e{e^) > 2". 

A theorem of Petty [221 (see Section [n|l states that e{X) < 2" if the dimension of X is n; this then 
gives e{i2a) = 2". Kusner [201 asked whether e(£") = 2n and e(£p) = n + 1 holds for all 1 < p < oo. 
For p an even integer it is easy to show an upper bound of about pn using linear algebra (observation of 
Galvin; see Smyth [33]). It is possible to improve this upper bound to about pn/2, and in the particular 
case of p = 4 to show e{£2) = n + 1 [23 (see Section [OJ. For 1 < p < 2 there are examples known 
showing e{£p) > n+ 1 if n is sufficiently large (depending on p) [2Zj (see Section 13?^ . On the other hand 
if p is sufficiently near 2 (depending on n), then e{£p) = n+ 1. Smyth [X^\ gives a quantitative estimate 
(see Section ^21. 

The above results are not difficult; to improve the 2" upper bound for e(^") turned out to need deeper 
results from analysis and linear algebra. The first breakthrough came when Smyth |33| combined a linear 
algebra method with the Jackson theorems from approximation theory to prove e(^") < Cpn^P~^^'> ^ '-p~^^ 
for 1 < p < cxD. Then Alon and Pudlak ^ combined Smyth's method with a result on the rank of 
approximations of the identity matrix (the rank lemma; see Section lH!^ to prove e(^") < Cpn^^P~^^'''-'^P~^'^ 
for 1 < p < oo. For the £i norm Alon and Pudlak combined the rank lemma with a probabilistic 
argument (randomized rounding) to prove e(^") < en log n, and more generally e(£p) < Cpnlogn if p is 
an odd integer. 

Finally we mention that it is known that e(^f) = 6 (Bandelt et al. 3 ) and e{£f) = 8 (Koolen et al. 
[2Si)- We do not discuss the proofs of these two results. 

Problem 1 (Kusner). Prove (or disprove) that e{£i) = 2n for n > 5. 

Problem 2 (Kusner). Prove (or disprove) that e(^") = n+1 for p > 2 (p j^ 4). 

Problem 3. Prove (or disprove) that e(ff ) < en for 1 < p < 2. 

2 General upper bounds 

We now need the following notions from convex geometry. A polytope is a convex body that is the convex 
hull of finitely many points. A face of a convex body C is a set F Q C satisfying the following property: 

If a segment ab C C intersects S, then ab C S. 

The following are well-known (and easily proved) facts from convex geometry: 

1. Every point of a convex body C is contained in the relative interior of a unique face of C. 

2. The faces of a polytope forms a ranked finite lattice, with dimension as rank function. 

3. The 1-dimensional faces are called vertices, and the {n — l)-dimensional faces facets. 

4. Any {n ~ 2)-dimensional face is contained in exactly two facets. 

A parallelotope is a translate of the polytope {X]i=i ^iO'i ■ ~1 < •^ < 1} where ai, . . . , a„ are linearly 
independent. It is easily seen that if a Minkowski space has a parallelotope as unit ball, then it is isometric 

2.1 The theorem of Petty and Soltan 

The following theorem was proved by Petty [23] , and later also by P. Soltan "31" . A different proof, using 
the isodiametric inequality, is given in |15| . 



Theorem 1 (Petty [29_ &; Soltan f54j). For any n-dimensional normed space X, e{X) < 2" with 
equality iff X is isometric to P^, in which case any equilateral set of size 2" is the vertex set of some hall 
(which is then a parallelotope) . 

Proof. Let pi, . . . .pm G X be 1-equilateral, where m — e(X). Let P — conv{pi, . . . ,Pm} be the convex 
huU of the points Pi. If P is not full-dimensional we may use induction on n to obtain m < 2"^^ < 2" 
(and the cases n = and n = \ are trivial). 

Thus without loss of generality P is full-dimensional. For each i = 1, . . . ,r7i, let P^ = i(P + pi). Note 
the following easily proved facts: 

1. Each Pi C P. 

2. Each Pi C B{pi,\). 

3. For any i ^ j, Pif] Pj has empty interior. 
Thus the Pi pack P, and we now calculate volumes: 



m 
^VO. 

1=1 



l(P,)=vol(|JP,)<vol(P). 



Since vol(P,) = (5)" vol(P), we obtain to < 2". 

If equality holds, P is a polytope in M" that is perfectly packed or tiled by 2" translates of ^P. 
By the following lemma P must be a parallelotope v + {J2"=i -^i^i '■ ^^ 1^ ^ ^ ^} with vertex set 
{pi, . . . ,Pm} — V + {X)"=i ^i^i ■ ^i = il}- Since Pi — Pj is a unit vector for i ^ j, it follows that 
^"^j^ 2eiai is a unit vector for all choices of Si G {—1, 0, 1} with not all Si ~ 0. It is then easy to see 
that the unit ball is the parallelotope {Y^^=i \0'i : ^ 1 < A < 1} (exercise), and the linear map sending 
ai 1-^ Bi is an isometry. D 

Lemma 1 (Groemer [17 ). Let P he the convex hull ofvi, . . . , V2" G K". Suppose that P — [Ji^i ^{P + 
Vi). Then P is a parallelotope with vertices Ui, . . . , V2" ■ 

Proof. Exercise. D 

2.2 Strictly convex norms 

A normed space is strictly convex if||cc + yj| < j|a;|| + ||y|| for all linearly independent x,y, or equivalently, 
if the boundary of the uniy ball does not contain a line segment. If X is a strictly convex Minkowski 
space, then e{X) < 2" — 1 by Theorem^ but as far as I know there is no better upper bound for n> A. 

Problem 4. If X is an n-dimensional strictly convex Minkowski space, n > A, prove that e{X) < 2" — 2. 

In Section [3.11 we'll see that e{X) = 3 when dimX — 2, and for dimX = 3, Petty |5^ deduced 
e{X) < 5 from a result of Griinbaum |19j . 

Conjecture 1 (Fiiredi, Lagarias, Morgan |15j ). There exists some constants > such that if X is 
a strictly convex n-dimensional Minkowski space then e{X) < (2 — e)". 

There exist strictly convex spaces with equilateral sets of size at least exponential in the dimension. 

Theorem 2 (Fiiredi, Lagarias, Morgan |15p . For each n > 271 there exists an n-dimensional strictly 
convex X with an equilateral set of size at least 1.058". 

In 15, the lower bound 1.02" was obtained; however, their argument easily gives the bound as stated 
here. They constructed a norm ||-|| on R" with the property that ||-|| — e||-||2 is still a norm, and asked 
whether the norm can in addition be C°° on M" \ {0}. In our construction the norm will lack the above 



property, but it will be C°°. However, most likely the norm in the above theorem can be made to have 
the above property and be C°°. 

We need the following very nice high-dimensional phenomenon. It is a special case of the Johnson- 
Lindenstrauss flattening lemma |55j , although this special case is essentially the Gilbert- Varshamov lower 
bound for binary codes (see |25|.) 

Lemma 2. For each (5 > there exist e ~ e{5) > and uq — no{S) > 1 such that for all n > hq there 
exist m > (1 + e)" vectors Wi, . . . , Wm G K" satisfying 

\{wi,Wi) = 1 for all i, 

1 \{wi,Wj)\ < S for all distinct i,j. 

We may take e = 5^/2 and no > (1201og2)/(255'^ - S^). 

We need the Chernoff inequality (proved in e.g. |25l Theorem 1.4.5]). The binary entropy function is 
defined by H{0) = H{1) := and 

H{x) := -cclga; - (1 - a;)lg(l - x), for < a; < 1. 

Lemma 3. For any e > and n G N, 

0<fc<en ^ ^ 

Proof of Lemma\^ For n > 1, define a graph on { — 1,1}" as vertex set by joining two points if the 
number of coordinates in which they differ is < ^(1 — S)n or > ^(1 + S)n. This graph is regular of degree 

0<fe<i(l-5)n ^ ^ i(l+(5)Ti<A;<n ^ ^ 

by ^. If X and y are not connected in this graph, then n^^''^x and n^^/^y are unit vectors satisfying 
|(a;,y)| < S. We therefore have to show that the graph contains an independent set (a set without an 
edge between any two vertices) of at least (1 -I- e)" points. We arbitrarily choose a vertex, delete it and 
its m neighbours, choose a remaining vertex, delete it and its at most m neighbours, and continue until 
nothing remains. Thus we obtain an independent set of size at least 

2"/(m+l) <2"(i-^«i-^)/2))-i 

vertices, which is greater than (1 + (5^/2)" if 

nf{S) > 1, 

where f{S) = 1 — H{{1 — S)/2) — lg(l + S'^/2). But this is true for sufficiently large n, since it is easily 
seen that f{d) > for < (5 < 1. That n > (1201og2)/(25(5'' — S^) is sufficient can be seen by calculating 
the Taylor expansion oi f{6). D 

Remarks 

1. The proof shows that if M{n, S) is the maximum number of unit vectors in R" satisfying Q, then 



limilgM(n,^)>l-i7(i-^)=-I-(^ + ^ 
n^oon ^ ^ ' ' - \ 2 J log2 V 2 12 



If one repeatedly choose unit vectors on the Euchdean unit sphere §"~^ and delete spherical caps 
around them, then, as shown by Wyner |40i pp. 1089-1092] (slightly improving an earlier estimate 
of Shannon |3^ ) , 



iigM(„..)>,g,i-o-"^^;^(^4+..o. 



lim 

which is the same up to first order, but has a better higher order term. Thus the above greedy 
argument is not really improved by choosing vectors from the Euclidean unit sphere. 

2. Recently, the Gilbert- Varshamov lower bound for binary codes has been slightly improved |21| . 
which will improve the lower bound in Lemma[21to c{S)n{l + 5^/2)". 

3. If we prove this lemma by choosing vectors in {±1} randomly, and estimating the probabilities in 
the obvious way, then we obtain a slightly worse dependence of e = (5^/4. 

Proof of Theorem\^ By Lemma |21 there exists Wi, . . . , Wm, rn > 1.058", satisfying Q with 5 = 1/3. 

We now show that P ~ cony{wi — Wj '■ 1 < i, j < m} is a centrally symmetric polytope with in{m — 1) 
vertices Wi — Wj, i j^ j. It is sufficient to note that the hyperplane 

{x E M" : {x, Wi — Wj) = {wi — Wj,Wi — Wj)} 

is a supporting hyperplane of P passing only through Wi — Wj . This follows from 

{Wj - Wi, Wi - Wj) , ± {W.i - Wk, W^ -Wj) , 

± {wj - Wk,w.i ~ Wj) , ± {wk - We, w^ - Wj) 
< {wi — Wj , Wi — Wj) for distinct i,j,k,£, 

which follow easily from (^ and 6 — 1/3. 

With P as a unit ball we thus already have {wi : i — 1, . . . , m} as an equilateral set. We now need to 
find a strictly convex centrally symmetric body with the vertices of P on its boundary. This is provided 
by the next lemma. D 

The following lemma is obvious, but we provide a simple proof. 

Lemma 4. Let S be the vertex set of a centrally symmetric polytope in M". Then there is a smooth, 
strictly convex norm \\-\\ on M" such that \\x\\ — 1 for all x £ S. 

Proof. Let 5* = {±xi, . . . ,±x„i}. For each Xi, choose yi £ R" such that {yi,Xi) — 1 and \{yi,Xj)\ < 1 
for all i =/= j. The required norm will be 

11^11 = (E^:'Ky^'^)i') 

for suitably chosen Xj > and 1 < p < oo, i.e., we want to imbed S into the unit sphere of £™ for some 

P- 

Choose p large enough such that 

\{yj,Xi)\P < — for aU j ^ i. 



Consider the matrix A = [oy] . . with 



\{yj,x.,)\P iii^j, 
iii= j. 



Considering A as a linear transformation on ^™, we have \\A\\ < i. We want to find a vector v = 
(Ai, . . . , Am)*'' > such that 

since then we would have \\xi\\ — 1 for all i. However, since ||A|| < ^, I + A is invertible, and then 
necessarily v = {I + A)~^{1, . . . , 1)*''. Also, 

lit; - (1, ... , irnu = \\{i+ Ar\i, . . . , ir - (1, ... , irioo 

<ii(/+^ri-/iiii(i,...,i)iioo 

= ii£A^ii<£pir^<i. 

i=l 1=1 

Thus, A, > for all i. D 

Note that in the above lemma we may choose p to be an even integer which gives a norm that is C°° 
on R" \ {o}. 

3 Simple lower bounds 

We now consider the problem of finding equilateral sets in a general Minkowski space, thus providing a 
lower bound to e{X). 

3.1 General Minkowski spaces 

Proposition 1. If dim X > 2 then e{X) > 3. 

Proof. Exercise. (Hint: Euclid Book I Proposition 1.) D 

The above proposition combined with Theorem ^ gives the following 
Corollary 1. If dim X = 2 then 

I 4 if the unit ball of X is a parallelogram, 

I 3 if the unit ball of X is not a parallelogram. 

In general one would hope for the following conjecture, stated in [181 1281 I^I39| . 

Conjecture 2. If dim. X = n then e{X) > n + 1. 

As seen above this is simple for n — 2, and the next theorem shows its truth for n = 3. However, 
for each n > 4 this is open, and the best that is known is the theorem of Brass and Dekster giving 
e{X) > c(logn)^/'^, discussed in Section|5| 

Theorem 3 (Petty [29 ). // dimX > 3 then any equilateral set of size 3 may be extended to an 
equilateral set of size 4. 

The proof needs the following technical but simple lemma. 

Lemma 5. Let S2 be the unit circle (i.e. boundary of the unit ball) of a 2 -dimensional Minkowski space. 
Fix any u e 5*2- Let f : [0, 1] -^ S2 be a parametrization of an arc of S2 from u to —u. Fix p = Xu, 
A > 0. Then t t— > ||p— /(Oil ^^ strictly increasing before it reaches the value ||p + tt||, i.e. on the interval 
[0,to], where to ^min{t:\\p-f{t)\\ <||p + m||}. 

Proof. Exercise. D 



Proof of Theorem\^ Let a,b,c be 1-equilateral in X. Without loss of generality a = o. Let X2 = 
span{6, c}, and let X3 be any 3-diniensional subspace of X containing X2. Let S2 be the unit circle of 
X2, and let 53 be the unit sphere of X3. 

Consider the mapping / : S'3 — > M^ defined by f{x) — {\\x — b\\, \\x — c\\). We have to show that 
(1,1) is in the range of /. If we can show that the restriction of / to S2 encircles (1,1), then the theorem 
follows from the following topological argument. 

5*2 can be contracted to a point on 5*3, since 5*3 is simply connected; thus f{S2) can also be contracted 
to a point; if (1, 1) is in the interior of the Jordan curve f{S2), then at some stage of the contraction the 
curve must pass through (1, 1), since R^ \ {(1, 1)} is not simply connected. 

Thus we assume without loss of generality that (1, 1) ^ /(S'2), and we now follow the curve f{S2)- 
We start at /(6) — (0, 1) and go to /(c) = (1, 0). By Lemma the x-coordinate increases strictly while 
the y-coordinate decreases strictly. Therefore the arc of the curve strictly between (0,1) and (1,0) is 
contained in the square < a;,y < 1. Then from (1,0) to /(c — b) = (||c — 26||, 1) the y-coordinate 
increases strictly, and the x-coordinate also increases strictly at least in a neighbourhood of (1, 0) if 
II c — 26|| > 1. However, by the triangle inequality we have ||c — 26|| > 1, and if ||c — 2b|| = 1 then we 
already have found (1,1) to be in the range of /, which we assumed not to happen. 

From /(c — b) to f{—b) = (2,||6 + c||) the x-coordinate increases, and from /(— b) to f{~c) — 
(II 6 + c||, 2) the y-coordinate increases. From /(— c) to /(b— c) = (1, ||6 — 2c||) the x-coordinate decreases 
strictly, and the y-coordinate decreases. Again by the triangle inequality and assumption we have ||b — 
2c|| > 1. Finally, from f{b — 2c) to f{b) the x-coordinate decreases strictly, and the y-coordinate 
decreases. 

It follows that f{S2) encircles (1, 1), and the theorem is proved. D 

Note that if the fourth point which extends the equilateral set is in the plane X2 (i.e. if (1, 1) e /(S'2) 
in the above proof), then X2 is isometric to £'^, i.e. S'2 is a parallelogram (by Corollary QJ. However, it 
is impossible for all 2-dimensional sections of the unit ball through the origin to be parallelograms. Thus 
we may always choose an X2 so that the fourth point is outside X2 to obtain a non-coplanar equilateral 
set of four points. 

One would hope that ideas similar to the above proof would help to prove Conjecture El However, 
the above proof cannot be naively generalized, as the following example of Petty (52] shows. 

Define the following norm on M": 



||(xi,...,x„)|l := \xi\ + yxl H h; 



Thus ll-ll is the ^i-sum of M and f^^^: (R", ||-||) = R ®i i^~^. Its unit ball is a double cone over an 
(n — l)-dimensional Euclidean ball: 

B||.|| =conv(BJ"^U{±ei}). 

For any n > 2, (R", ||-||) contains a maximal equilateral set of four points. In fact the largest size of 
an equilateral set containing ±ei is 4. First of all note that 

||a; — eill = ||a; + eill <=^ xi = 0. 

Secondly, since ||ei — (— ei)|| — 2, we want ||a; ± ei|| = 2. Since then xi = 0, we obtain that 1 -f 
a/xI -I- • • • -I- x^ = 2 must hold, which gives that x £ §"^^. Clearly there are at most two points on the 
Euclidean (n — 2)-dimcnsional unit sphere at distance 2 (and two such points must be antipodal). 

On the other hand any n equilateral points in the (n — l)-dimcnsional Euclidean subspace xi = can 
always be extended to n -I- 1 equilateral points. For example, if we choose the n equilateral points on 
§"~^ then Aci will be the {n + l)st point, for some appropriate A G R. 

Problem 5. Calculate e{X) for the Minkowski space in the above example. 



3.2 r with I <p<2 



We sketch the proof that e(^") > n + 1 for any 1 < p < 2 with n sufficiently large (depending on p). 
Note that the four ±l-vectors in R'^ with an even number of — I's, i.e., 



"l" 




" f ■ 




"-1' 
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f 




-f 
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-1 


1 




-1 




-1 




1 



(3) 



is an equilateral set for any p-norm. Thus if we consider R^ to be the direct sum M'^ < 
above four vectors in each copy of R'^, we obtain the following 8 vectors in R^: 



and put the 



(4) 



Then it is easily seen that these 8 vectors are equilateral iff 2 • 2^^ = 6 ■<=^ p — log 3/ log 2. 

The combinatorial reason why this worked is that any two of the vectors in Q differ in exactly two 
positions. For a generalization we use Hadamard matrices. An Hadamard matrix is an n x n matrix H 
with all entries ±1 and with orthogonal columns. By multiplying some of the columns of an Hadamard 
matrix one obtains an Hadamard matrix with the first row containing only I's. If we remove this first 
row, we obtain n column vectors in R"~^ with ±1 entries such that any two differ in exactly n/2 positions. 
The vectors in ^ was obtained in this way from a 4 x 4 Hadamard matrix. We may now construct 2n 
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vectors in 



p2n-2 



as before that will be equilateral iff 



-2P = 2n - 2 



P 



le 



1 



2 n 

The only problem remaining is to find Hadamard matrices. It is easy to see that for an n x n Hadamard 
matrix to exist, n must be divisible by 4. It is an open problem whether the converse is true. However, 
the following well-known construction gives a Hadamard matrix of order 2*^: 



Ho — [l] , Hk+i 



Hk 
Hk 



Hk 
-Hk 



fc> 0. 



Thus we have e{ip) > n+ 1 for p arbitrarily close to 2. 

With a more involved construction the following can be shown: 

Theorem 4. For any 1 < p < 2 and n> 1, let 

"log(l 



k^ 



2P-2y 



Then 



e(C) > 



log 2 

2k+i 
2^+1- l' 



In particular, if n > 2'"'+^ — 2 then e{£p) > n + 1. 



l+o(l) 
(log2)n- 



Corollary 2. e{e^) >n + l if p < 2 - 

Compare this with Corollary O in Section IT!^ The smallest dimension for which the theorem gives an 
example of e(£" ) >n+lisn = 6 (and 1 < p < log 3/ log 2). In f^Tj there is also a 4-dimensional example 
(with l<p< log 1/ log 2). 

In the next two sections we show that e{X) goes to infinity with the dimension of X (Brass-Dekster), 
although the best known lower bound for e{X) is much smaller than linear in n. There are three 
ingredients to the proof: The Cayley-Menger theory of the embedding of metric spaces into Euclidean 
space, Dvoretzky's theorem, and the Brouwer fixed point theorem. 



4 Cayley-Menger Theory 

In this section we discuss a fragment of the Cayley-Menger theory by giving necessary and sufficient 
conditions for a metric space of size n + 1 to be embeddable as an affinely independent set in £2 ■ The 
general theory is by Menger [23 ; see also Blumenthal .8, §40] . 

4.1 The Cayley-Menger determinant 

Let a metric space on n + 1 points po, . . . ,pn be given, with distances pij = d{pi,pj). We now derive a 

necessary condition for pq, . . . ,pn to be isometric to an affinely independent subset of £2, in terms of pij. 

So we assume that {po, . . . ,p„} is an affinely independent set in £2 with ||Pi — Pj||2 — Pij- We write 



(1) J2) 



the coordinates of pi as {p- ' , p. 
Pi. Then 



Pi )■ Let A = convjpo 



± vol(A) 



1 
n! 



v'^' 

P'^' 



(n) 

Pi) 



P'? 



(n) 
Pi 



p„} denote the simplex with vertices 



(2) 



„(») 



Po Pi 

1 1 



Pr 



Squaring (and adding an extra row and column), 



vol(A)2 



(n!) 



W 



dot 



P\] 
pf 



„tr 



Po • 


• Pn 





1 • 


• 1 





• 


• 


1 



(n!) 



1^2 



det 



Po 

pT 



Pn 






1' 







1 







1 




1 


0_ 





Po ■ 


■ Pn 





1 • 


• 1 





• 


• 


1 



(interchange last two 
columns of first matrix) 



(n!)2 



(Po,Po) 
(Pi,Po) 



\Pn,P0) 

1 



1 



2"+i(n 



1^2 



(Po,Pi) 
(Pi,Pi) 



(Pri,Pl) 
1 

-Pai 



(Po,P«) 
(Pi,P«> 



(Pri,Pn) 
1 



PO71 



-pIo 



PnO 



rn,n— 1 



(use2(pi,pj) = 

(pi.Pi) + {Pj.Pj] 
subtract multiples of the 
last row and column) 



Pij and 



(-1)"+! 

2"+i(n!)2 



det 



P 
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where P = [pf^] . We call 



CMdet(po,...,Pn) 



P 



1 



1 

1 



the Cayley-Menger determinant of the (n + l)-point metric space. We have shown the following. 

Proposition 2. // a metric space on n + 1 points can be embedded into £2 ^■^ '^'^ o,fP-nely independent 
set, then its Cayley-Menger determinant has sign (— 1)""*"^. (Also, if the metric space can be embedded 
as an affinely dependent set, then its Cayley-Menger determinant is 0.) 

Conversely we have 

Theorem 5 (Menger j27l). A metric space on n + 1 points pq, . . . ,pn can be embedded into £2 as an 
ajfinely independent set if for each k = 1, . . . ,n, the Cayley-Menger determinant of po, . . . ,Pk has sign 



Proof. For n — I, 



CMdet(po,Pi) 






Poi 


1 


pit) 





1 


1 


1 






= Poi + pIo > 0- 



Thus the theorem is trivial for n = 1 (and even more so for n = 0). 

Thus we may assume as inductive hypothesis that the theorem is true for n — 1. Thus po, 



,n-l. 



can be embedded into £3" C £2 as an affinely independent set, say po 1-^ o and pi i~* Xi, i = 1, . 
Then span{a;i, . . . , a;„_i} = £2" and Xi, . . . , a;„_i are linearly independent. 

We have \\xi\\2 = Pio, and \\xi — Xj\\2 — Pij for 1 < i,j < n — 1, and we have to find an a; ^ ^2 "'^ 
satisfying 

\\x\\2=P0n (5) 



and 



\x- Xi\\2 = Pin for aU i = 1, . . . ,71- 1. 



(6) 



We write x — v + Ae„ uniquely, where v £ £2 ^ and A e R. If we square (jHl and simplify, we find that 
V has to satisfy the following n— 1 linear equations in £2^^: 



2(xi,v) = \\x\ 



2 _ 2 ■ _ 1 

Poi Pin : * — -L I 



1. 



Since the Xi are linearly independent, there is a unique solution v E £2 to the following modified 
equation: 

2{x„v) =pln+Ph-pL, i = l,...,n-l, (7) 

and we will have found the required x once we can satisfy (jSJ, which is equivalent to 



A^ 



pL 



Thus it remains to prove that Pq„ — ||t>||2 > 0. To this end we calculate the Cayley-Menger determinant 
of o, a;i, . . . , a;„_i, t>, which is by Proposition |2 





Pw 



o^ 

11^.112 



pIi 



\V-Xi\\i^ 

1 



2 
Ptn-l 




ll^lll 

\\v — Xi\W 


1 
1 




1 


11 


\\v-Xn-l\\l 



1 


1 
1 





= 0. 
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Multiplying out 



pin + Pi 



(by©) 



and using the last row and column to eliminate \\v\W 



Pw 



Poi 



Pn-1,0 
Pn,Q Pn,l 

1 1 



2 
P0,n-1 

Pln-l 



Pn,n-1 



Po„, we obtain 



2 
POn 

pin 



Pn-1.' 



-mAi-pin) 



= 0. 



This determinant differs from CMdet(po, ■ • ■ ,Pri) only in the second last column. Thus we may subtract 
this determinant from CMdet(poi • ■ • 7Pn), which we are given has sign (—1)"+^, to obtain the following 
determinant of sign (—1)"+^: 







Pn-1,0 
Pnfi 



pIi 



pI,i 



2 
P0,n-1 

Pl,n-1 



rn.n 



1 1 

2(l|t;|l2-Po„)CMdet(po, 



1 

, 2{\\v\\l-plJ 1 

1 1 



, p„_i) (expanding along 2nd last column). 



Since it is also given that CMdet(poi • ■ • ,Pn-i) has sign (—1)", it follows that ||f II2 ^ Pq„ < 0. This gives 



us a A € R such that x = v + Ae„ satisfies (jJJ as well. The theorem is proved. 



D 



4.2 Embedding an almost equilateral n-simplex into £2 

We now want to use TheoremElto show that an n-simplex with all side lengths close to 1 can be embedded 
isometrically into £2- Thus we have to find the sign of the determinant of a matrix with on the diagonal 
and all off-diagonal entries close to 1, since this is how the Cayley-Menger determinant will look like. 

We start with the following simple observations. Let J be the n x n matrix with all entries equal to 
1. 

Lemma 6. Let n> I. 

1. The characteristic polynomial of J is (— l)"(a; — n)x"~^ . 

2. Therefore the characteristic polynomial of J — I is (— l)"(a; — n + l)(x + 1)"^"'^, and det(J — /) = 
(-l)"-i(n~l). 

3. Therefore J— I is invertible, and its inverse has eigenvalues —1 with multiplicity n—1, and l/(n — 1). 
Proposition 3. Let A — [a,;j] be an (n + 2) x (n + 2) matrix with Oi^ = 0, |1 — aij\ < l/{n — |) for 



^l^ij^jl^n+l, and ai^n+2 



J-n+2A 



1 for i = 1, . . . ,n. Then det{A) has sign (— 1)"+^. 



Proof. We first prove that A is invertible. Let E ~ [e^ ] := J— I— A. Then en — and ei^n+2 = £n+2,i ~ 
for alH, and \e,j\ < !/(«-§) for aU l<i^ j <n + l. Since ^ = J-I -E= {J - I){L - {J - I)-^E), 
it is sufficient to prove that ||(J — I)^^E\\2 < 1, where ||-|J2 is the operator norm on L(^2 ~^ ^2)- 
Firstly, since the operator norm is the maximum modulus of the eigenvalues, we have by Lemma |S1 that 
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|1(J- /)-i|l2 ^ 1. Secondly, by the Cauchy- Schwartz inequahty, \\E\\2 < ^ E"ji^i 4 < 1- Thus A is 
invertible. 

To show that detA has sign (—1)"+^, observe that we may join J — I to A with a curve At, < t < 1, 
where each At satisfies the hypothesis of the proposition. Thus we know from what we have just proved 
that each At is invertible. By continuity considerations the sign of det At does not change. Therefore, A 
and J — I have the same sign, which is (—1)"+^ by Lemma|Sl D 

Theorem 6 (Dekster & Wilker [14j ^. Let p^, . . . ,pn be a metric space with distances satisfying 

l<d{p,,pj)<l + —^. 

Then po, . . . ,p„ can be embedded into (,2, and any such embedding must be an affinely independent set. 

Proof. We first scale the distances by some a > in such a way that the given inequality is transformed 
into 

!-£< {ad{p,,p,)f<l+e 

for some e > 0. Thus e and a must satisfy 

— r— = 1 and — TT- 



n + l 
Eliminating a we find 

We want to apply Theorem|Sl Since the CM determinant of poj • ■ ■ , Pfe is a (fc + 2) x (fc + 2) matrix with 
A; < n, to show that it has sign (—1)'^+-'^, it is sufficient to prove that 



according to Proposition |3| By 101 we have 





«+!' 




i have 






(se)'-! 






f;i?V^i 






(n + 2)2 - (n 


+ 1)2 


2n + 3 


(n + 2)2 + (n 


+ 1)2 2n2 + 6n + 5 


2n + 3 


2 




2^2 + 6n + 1 


2n + 3' 





< 

By Proposition 121 no embedding can be affinely dependent. D 

Dekster and Wilker found the smallest 7„ > 1 such that whenever 

1 < d{pi,pj) < 7„ 
then po, . . . ,Pn+i can be embedded into £2- it is 



^+m 


if n is even. 


^+^ 


if n is odd. 



7n = 

Their proof is geometrical. The above analytical proof gives values that differ from 7„ by 0(ji~^) as 

n — > cx). 
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Corollary 3 (Schiitte |30| ). For any pi, . . . ,Pn+2 G ^2 ™fi have 

maxllp, -pjlla ^ _^ , 1 



mmi^jWPi - Pj\\2 n- 

Schiitte in fact determined the smallest (5„ > 1 such that 

max\\pi-pj\\2 >-^_^^ 



miUi^jWPi -Pj\\2 

for all possible Pi, ■ ■ ■ ,Pn+2 & ^2- it is Sn = 7n+i- This result was also discovered by Seidel [3T], and 
follows from the embedding result of Dekster and Wilker ^|; see j^ for a very simple proof. 

Corollary 4. Ifd{X,il^) <1 + :^, then e{X) <n+l. 

Corollary 5 (Smyth [33 ). // \p - 2\ < ^^^j^ then e{i^) =n + l. 
Compare this with Corollary |21 in Section IX^ 

5 The Theorem of Brass and Dekster 

Theorem 7 (Brass | 1U| &: Dekster |13|^. An n-dimensional Minkowski space contains an equilateral 
set of size c(logn)^'^ for some constant c > and n sufficiently large. 

Both Brass' and Dckster's proofs use Dvoretzky's theorem combined with a topological result. We 
follow Brass' proof, which uses only the Brouwer fixed point theorem. 
For the proof we need Dvoretzky's theorem. 

Dvoretzky's Theorem. There exists a contant c > such that for any e > 0, any Minkowski space X 
of sufficiently large dimension contains an m-dimensional subspace with Banach-Mazur distance at most 
1 + e to i^ . We may take dimX > e'^'"'^ for some absolute constant c > 0. 

The estimate for dimX, which is best possible (up to the value of c), was proven by Gordon \W\ . 

Proof of Theorem^ Denote the n-dimensional space by X. Let m = ci(logn)^/'^ and e = l/(m + 1). A 
simple calculation gives that e'^"^^ < n if we choose ci = ci(c) sufficiently small. Dvoretzky's theorem 
gives that X contains an m-dimensional subspace Y with Banach-Mazur distance at most 1 + ^^^i-j- to 
^™ . The proof is then finished by applying the following theorem. D 



Theorem 8 (Brass [10 &: Dekster |13p. Let X be an n-dimensional Minkowski space with Banach- 
Mazur distance d{X,(,2) ^ 1 + ;jXT' Then an equilateral set in X of at most n points can be extended to 
one of n -\~ I points. In particular, e(X) > n -\- 1. 

Proof. By induction on nested subspaces of X it is sufficient to prove that a 1-equilateral set pi, . . . ,Pn 
may be extended. We may assume that a coordinate system has been chosen such that 

||a;|| <||a;||2<(l + £)||a;||, 

where e = {n + 1)^^ . Let pi, . . . ,pn be a f -equilateral set in X. We want to find a p & X such that 
Up ^ Pill = 1 fo'' alH = 1, . . . ,n. 
We have 

1 < \\Pi — Pj\\2 < 1 + £ for all distinct i,j. 

By Theorem |S1 pi , . . . ,p„ are affinely independent. For any pi,. . . ,pn G [1, 1 -I- e] there exists a point 
X E X with II a; — Pi II 2 = Pi, i — I,. . . ,n, by Theorem El Furthermore, by fixing a half space bounded 
by the hyperplanc through pi,...,p„, there is a unique such x in this half space (by simple linear 
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algebra arguments as in the proof of of Theorem|ni exercise). We write x —: f{pi, . . . ,Pn), and define 
(f : [0, £]" -^ [0, e]" by sending {5i,..., S„) i-^ (yi, . . . , ?/„), where 

yi=^ Si + 1- \\f{l + Si,...,l + Sn)~Pi\\, i^l,...,n. 

Since 1 + 6i < 1 + e, each yi is well-defined. Also, 

y. < S, + l-{l+e)-^\\f{l+Sl,...,l+Sn)-p^\\2 

= S, + 1 - (1 + e)-\l + S^) 

< (l-(l + e)-i)(l+e)-e, 
and 

y^ > S, + l-\\f{l+Sl,...,l+Sn)-p^\\2 

which shows that (p maps into [0, e]". 

By Brouwer's fixed point theorem, (p has a fixed point ((5i, . . . , (5„), which gives 

11/(1 + (5i,..., 1 + ,5„) - P.ll = 1 foraUi = l,...,n. D 

Corollary 6 (Brass ^ & Dekster ^). IfdiXJ"^) < 1 + ;^ i/ien e(X) =n + l. 
Proof. Combine the above theorem with Corollary 01 D 

6 The Linear Algebra Method 

Linear algebra provides a very powerful counting tool. The basic idea is that whenever one has a set 
S' of TO elements, and one wants to find an upper bound to to, one constructs a vector Vs in some TV- 
dimensional vector space for each element of s G S*. If one can prove that the vectors constructed are 
linearly independent, then one immediately has the upper bound to < TV. If the vectors are not linearly 
independent, then one may try to prove that the square matrix A^'^A has a high rank, where A is the 
matrix with column vectors v^^ s £ S. 

6.1 Linear independence 

As a first example, one can prove that 6(^3) < "- + 1 as follows. Let S C £2 be 1-equilateral, and define 
for each s d S a polynomial in n variables 



Ps{x) ^ Ps{xi,...,Xn) -.^l-Wx- s 



2 



Then it is easily seen that {1} U {Ps{x) : s e S*} is linearly independent in the vector space M[xi, . . . , a;„] 
of polynomials in xi, . . . ,Xn with real coefficients, and secondly that all Ps{x) are in the linear span of 
{1, J2^=i 2^1; 2;i, . . . , x„}. It follows that 1 -I- l^l <2 + n, which is what was required. 

In the same way one may prove that e(£") < 1 + (p — l)n if p is an even integer (an observation due 
to Calvin; see [33 )• With some more work the following may be shown: 

Theorem 9 (Swanepoel [37 ). For each n > 1, e(^2) = n + I. 

Proof. The lower bound has already been shown in Section ll. 21 

Let 5' be a 1-equilateral set in £J. For each s G S, let Pa{x) = Ps{xi, . . . ,Xn) be the following 
polynomial: 

Ps{x) := l-\\x-s\\l 

71 n n n 

i—l -i— 1 i—1 i—1 
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Thus each Pg is in the hnear span of 

n 

{l,^xf}UK^:i = l,...,n;fc = l,2,3}, 

which is a subspace of M[a:;i, . . . , x„] of dimension 2 + 3n. Since Ps{s) — 1 and Ps{s') = for aU distinct 
s,s' G S we have that {Ps : s e S*} is hnearly independent. (So aheady |S'| < 2 + 3n.) 
We now prove that 

{Ps : s e S} U {1} U {x'^ : i = 1, . . . ,n;k ^ 1, 2} 

is still linearly independent, which will give 15*1 + 1 + 2n < 2 + 3n, proving the theorem. 
Let As {s e S), A, Ai, ^i (i = 1, . . . , d) G R satisfy 

n n 

^XsPs{x)+Xl+^X^X,+J2^^^^^^=0 Vxi , . . . , 2;„ € K. (10) 

s£S i=l i=l 

Substitute © into (fTn| : 

(- E ^^) E ^' + 4 E(E ^«^o^f + E(m^ - 6 E ^«^'K' 

n 

+ ^(A.+4^4)a;. + ^A.-^A.||s||4 + A = 0. 
j=i ses ses ses 

Since the above is a polynomial in R[xi, . . . , Xn] which is identically 0, the coefficients are also 0: 

E^« = 0' (11) 

(12) 
(13) 
(14) 

_ (15) 

ses 

By substituting x = s £ S into H10|) we obtain 

n n 

Xs + X + Y, -^^^^ + E /^^^^ = VseS. (16) 

Now multiply (|16(l by A^ and sum over all s G 5: 

^ A2 + A ^ As + ^ A,;(^ AsS,;) + 5] A*.(5] Ass2) = 0. 
ses ses i=i ses i=i ses 

Then use (|ll|l -()13 |l to simplify this expression as follows: 

Ea^ + ^Em?-o- 

ses i=i 

It follows that all As = and all fii — 0. From (fTl|l it follows that all Xi = and from ifT^ that A = 0. 
Thus we have linear independence. D 
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ses 






ses 


= l,...,n, 




/Xi - 6 ^ Xgs'f = 
ses 


Vi = 1, . . 


. ,n, 


A, + 4 ) ; Xssf = 

ses 


Vi = 1, . . 


.,d, 


^+E^«-E^«i 

ses ses 


|s||| = 0. 





In the same way the fohowing may be proved: 
Theorem 10 (Swanepoel j37j ) . For p an even integer and n > 1 we have 

efrX 1^2 -1)" + 1 'fP^O (mod 4), 
li'^+l i/p=2 (mod 4). 

6.2 Rank arguments 

The following lemma is very simple, yet extremely powerful. 

Rank Lemma. Let A — [aij] be a real symmetric n x n non-zero matrix. Then 



rank A > 



En 



Proof. Let r = rank A, and let Ai, . . . , A^ G M be all non-zero eigenvalues of A. Then J^i O'U — trace(A) = 
J2l=i ^i- Also, J2i j ^Ij — trace(A^) — J2l=i ^h since A^ has non-zero eigenvalues Af . Therefore, 

by the Cauchy-Schwartz inequality. D 

Corollary 7. Let A — [aij\ be a real symmetric n x n matrix with an = 1 for all i and \aij\ < e for all 

i ^ j . Then 

n 

rank A > -, ^-^ . 

- 1 + (n - l)e2 

In particular, if e = n^^'^ then rankA > n/2. 

Also, if e < l/{n — 1) then the corollary gives that A is invertible, which also follows from the fact 
that A is then diagonally dominated. 

The rank lemma together with a simple rounding argument gives an upper bound for e(ff ), 1 < p < oo, 
that is polynomial in p and n. This is due to Alon (Smyth, personal communication). 

Theorem 11. For some constant c > we have that for 1 < p < oo and n > 1, e(£") < cp^n^+^'P. 

Proof. Let pi, . . . ,Pm be 1-equilateral in £". Since \pl — p, \ < 1 for all i ^ j and all fc = 1, . . . , tti, we 
may assume after a translation that all pl ' e [0, 1]. Choose an integer N such that 

(e - l)pn^/Py/m < N < N + 1 < epn^^Py/m. 

(The interval pn^^P^/m > 2 since m > 4 without loss of generality.) We now round each pi to a point 
in the lattice {0, ;^, ;^, . . . , ^}" as follows. Let q] ' be the integer multiple of 1/A^ nearest to p] ' , say 
1i — N where 1 < d{i, k) < N + 1 (if there is a tie, choose arbitrarily). Let qt — {ql , . . . , g.j ), 
and let Q be the m x m matrix [l — Hq; — 9j"||p] . .. Thus Q is an approximation of the identity /„i7 and 

we now estimate how close. The diagonal contains only I's. For each fc, \pi — ql \ < 1/2N, and 

Wqi-QjWp < \\Pt-Pj\\p + \\Pi-Qi\\p + \\Pj-Qj\\p 
< 1 + n^/P/N, 

and similarly, ||q,, - qj\\p > 1 - n'^/P/N. Thus 

i/p\P / 1 \P 1 1 



^-^^■ii^n^^Vj n^-^i^^Tw^) <^'^-^'^<^+7^ 
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since e^ < l + (e-l)x for < a; < 1. It follows that l-\\qi-qj\\Pp > -l/V^. Since {1 + x)p + {1 - x)p > 2 
for < a; < 1, 

i-k.-,,ii!;<i-(i-— ) <(i + — ) -'<7s^ 

Thus by Corollary [7| rankQ > m/2. Note that Q — X]fc=i Q'^' where 



i-\J''^ -J^hp 



i,j = i 



Define the {N + 1) x ?ti matrix 

^k = [^d(l,k) ^d{2.k) ' ' ' ed(rn,k)\ , 

where ei, . . . , Bn+i is the standard basis of M^+^ (recall ql = '^~^ )■ Secondly, define 

^ in \ N N \ Ji,i = l ■ 

Then it is easily seen that Qk — A^j^BAk- It follows that rankQ^ < A^ + 1, hence 

n 

rankQ < ^ rankQ^ < n{N + 1). 

k=l 

Putting the upper and lower bounds for rank Q together we obtain 

to/2 < n{N + 1) < epn^+^/P^, 
and TO < 4e^p-^n^+^/P. D 

7 Approximation Theory: Smyth's approach 

In the proof of TheoremE]we essentially approximated the function fp{t) — \t\P , t E [—1, 1] uniformly 
by step functions (look at the definition of the matrix B in the proof). For p > 1 the function fp is 
differentiable [p\ times, so one would expect that there are better uniform approximations to fp using 
polynomials or splines instead of step functions. This is indeed the case, and can be used to improve 
the bound of Theorem E] Smyth |SS1 used the theorems of Jackson from approximation theory to find 
the first non-trivial upper bounds for e{£p). Alon and Pudlak ^ improved his bounds using the rank 
lemma. We state the theorem of Jackson that is needed (called Jackson V) and then prove their result. 
The proofs of the Jackson theorems may be found in many texts on approximation theory, e.g. (Tij . 

Jackson V. There is an absolute constant c > such that for any / G C[— 1, 1] 

1. there exists a polynomial P of degree at most n such that 

\\f-P\\oo<CLuif,l/n), 
where uj{f, 5) := sup{|/(a;) — fiy)\ '■ \x — y\ < S} is the modulus of continuity of f , 

2. and if f is k times differentiable and n > k, then there exists a polynomial P of degree at most n 
such that 

11/ - ^lloo < r—jvi — r^^ — rT^^(^^'^' I/")- 

(n + l)n(n — 1) . . . [n — k + 2) 
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Lemma 7. For each 1 < p < oo there exists a constant Cp > such that for any d > 1 there is a 
polynomial P of degree at most d such that P(0) — and 

m^ - P{t)\ < ^ /or a/He [-1,1]. 

We have Cp < {cpY for some universal c > 0. 

Proof. By choosing Cp sufficiently large we may assume that d > 2p. Then fp{t) := \t\P satisfies / G 
C('=)[-l, 1] with k=\p']-l, and 



/(rpi-i)(f) = sgn^p^^\t)p{p -i)...{p-\p-]+ 2)\t\ 



p+1- \p-] 



Thus 



From Jackson V we obtain a polynomial P of degree at most d with 

cfpl-i 



\fp-P\\ 



< 



< 



{d+l)did^l)...{d- [p-] +3) 



p(p-i)...(p-bi+2)(i/df+i-rpi 






c 

If we subtract the constant term from P wc obtain a polynomial Q with Q(0) = and ||/p — Q||oo < 
2c/ dP. D 

Smyth |33| approached Kusner's problem with the idea of approximating the p-norm with polynomials 
using Jackson's theorems. He obtained an upper bound Cpn*^P+^^/(P^^) for 1 < p < oo using linear 
independence. The next theorem was proved by Alon and Pudlak by combining Smyth's approach with 
the rank lemma. 

Theorem 12 (Smyth (231, Alon & Pudlak [Tl). Forl<p<oo, e{e^) < c^n(2P+2)/(2p-i). We may 
take Cp — cp, with c > and absolute constant. 

Proof. Let pi, . . . ,p„i € £" be 1-equilateral. Let c — max(cp, (2^/^ — 1)"''), where Cp is the constant from 
LemmaO We fix an integer d such that cn^fm < dP < 2cny^ (possible since c > {2^/p — 1)^''). Let P 
be the polynomial from LemmaEI Define the m x m matrix A — [ay] by a^ = 1 — X]fc=i ^iPi ~ p) )■ 

Since X]fc=i ^iPi ~ p) ) i^ ^^ approximation of \\pi — Pj\\^, the matrix A is an approximation of the 
identity /,„. Again we estimate how close. Firstly, since P{Q) — 0, all an = 1. Secondly for any i ^ j, 

n 



P) ) 



fc=i 



Ak) 



fv 



fc=l 
dP Jm 



(fe) 



-vV) 



by choice of d. By CoroUaryElwe have rankA > m/2. 

We now find an upper bound for the rank of A. For each i — 1,2, 
variables 

n 

P,(a:i,...,a;„):=l-^F(pf 
fc=i 



, TO, define the polynomial in n 



Xi). 
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Thus aij = Pi{pj). Each Pi is in the hncar span of 



/c=l 



Thus the m polynomials Pi lie in a (2 + (d — l)n)-dimensional subspace of polynomials. Then the ith 
row vector of A, [Pi{pi), . . . , Pi{pm)\ , is in the linear span of 

{{f{Pi),---J{Pm)).feV}, 

which is a subspace of MJ^ of dimension at most \V\ = 2 + {d — l)n < dn. It follows that rank A < dn. 

Putting the upper and lower bound for rank A together, we find m/2 < dn. Since d^ < 2cny^, we 
obtain to < c^n(2p+2)/(2p-i) . n 

In the next section we find the best known upper bounds for e(£"), due to Alon and Pudlak. 

8 The best known upper bound for e{ii) 

Alon and Pudlak pP proved that e(C) < Cpnlogn if p is an odd integer. This matches the lower bound 
of 2n apart from the log n factor and the constant that depends on p. We here present their proof for 
the case p = 1. The proof for other odd p is simple once this case is understood, see jTJ for the detail. 

Theorem 13 (Alon and Pudlak [IJ). e(^") < crt logn. 

Proof. Let pi, . . . ,pm+i be 1-equilateral in £". After a translation we may assume Pm+i = o. Thus it is 
sufficient to prove m + 1 < en log n given that 



Ebfl - 1 Vz = l,...,TO, (17) 

n 

and ^Ipf^l = 1 yi<i^j<m. (18) 



We first show that by doubling the dimension we may assume that all pi > 0. We replace each p\ by 

r(pf\o) ifpf >o, 

\(0,-pf) ifpf <0. 

Now the points pi, . . . ,Pm are in If^, they are still 1-cquilateral, \\pi\\i = 1, and now all p.- > 0. Thus 
we may assume together with ()17|) and (|18|l that p^ > holds. (Once we have proven ni < en log n with 
this assumption, then to < 1 + c(2n) log(2n) < 3cnlogn in the general case.) 
We now express \pl — Pj \ in terms of min(p^ ',pj'). Since 

|a — 6| = a + b — 2 min(a, b) for a,b &R, 
and X]fe=i?'i — 1, we obtain 



/m= [i-iiPi-pjii]r, = i 



-l + 2^min(pf',pf) 



ij=i 



We now want to approximate min(a, 6) by an inner product (a, b) where a,b E R^. Then /^ will be 
approximated by 

A-- 



-l + 2((pW,...,p|")),(pW,...,pf): 
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''j=i 



Then rank A < 1 + Nn, since it is a linear combination of Jm of rank 1 and a Gram matrix of vectors 
in M . The approximation will be good enough so that the rank lemma will give rank^ > cim. Thus 
cim < Nn+ 1, and we will have to make sure that N < C2 logn. What will in fact happen is that N will 
be different for each coordinate fc = 1, . . . ,n, say Nk, and we'll have to make sure X]fc=i ^k < C2n\ogn. 
The rough idea for approximating min(a, 6) by an inner product is the following. For each x € [0, 1] 
define 

^=(:^-----:^^0,...,0)eR^. 



^Vlv' 



Vlv' 



[a;A^J times 



Then for < a < 6 < 1, 

(a, b') ~ jj lo,N\ = a — e = min(a, b) — e, 

where < £ < 1/A^. Let us see how far this brings us already. Since 



^mi 



(fc) Jk)\ 



mm Vpl , p 



fe=i 



•^(1) 



(p: 



,^)M 



,^] 



< 



N' 



the error by which A approximates Im is 



Sii < — 

'^ - N 



if we choose TV — 2nypm. By the rank lemma, rank A > (1 + o(l))m/2. Also, rankyl < Nn + 1. Putting 
the bounds together, we obtain m < cn^, already obtained in Theorem 1111 

We now fine-tune this idea. For any partition T' = {0 = uq < wi < • ■ • < ujv_i < ujq ~ 1} of [0, 1] 
into N intervals, we define for any x € [0, 1] the vector x-p E K^, where 



.0") 



x-ut-i 

y/Ut—Ut-1 ' 

10, 



3 >t, 



where t is such that x G {ut-i,ut). 



Remarks. 

1. We'll choose a different partition for each coordinate k — 1, 

and ({p\ , . . . , Pj ), {Pj , . • • , Pj ) ) are sums over /c = 1, . . 



. . , n. Since both J2k^™^^iPi 'Pj ) 
, n, the error in each coordinate will 



also sum up, so we first do the analysis in a single coordinate. 
2. We'll choose the partitions so that a coordinate will never hit an endpoint of any (uj_i, Uj). 
With the above definition we find for < a < 6 < 1 that 

{a-p,b-p) 



(V^I7~0)2 + (Vu2 - wi)2 + {^ut-i~ut^2f + a^r^b^^\ 



where a € (uf_i,Wf), 




if 6^ {ut-i,ut), 



a—Uf — i I 



{min(a, b) if a and b are in different intervals of V, 

Thus there is no error if a and b are in different intervals. However, when a and b are in the same interval 
of V, the error is bad, and we want to get rid of it. We do this by adding another N coordinates: Define 



x-p 



X - Ut-l 
yJUt - Ut-l 



et e 



bJV 



with t such that x G {ut~i,ut). 
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Then 



/^\J0 if a and b are in different intervals of V, 

li a,b e [ut-i,ut). 



ut — ut-i 



Since we want to subtract this, wc let the inner product on M ® K. be 

{{xi,X2),iyi,y2)) = {xi,yi) - {x2,y2) ■ 
We then obtain for < a < 6 < 1 that 

inin(a, b) if a and b arc in different intervals of V, 



{a-p,a-p),ibv,b-p)) = , -r , r ^ 

/ [uf_i it a,6 e {ut-i,ut). 

Thus if a and b are in the same interval, the approximation is a systematic rounding down. We now add 
a third vector in M.^ to improve the approximation in this case. We do this by randomized rounding: 
For each interval {ut-i,ut) we choose randomly and uniformly r G {ut-i,ut) (a threshold), and for any 
X € (wt_i,Mt) we define 

fo ifx<T, 

a;p = < 

[^ ^ut - ut-iSt ii X > T. 

Note that x-p is a random variable. Thus if a and b are in different intervals then (a-p, b-p) — 0, and if 
a and 6 are in the same interval {ut-i,ut), then (as with x-p) 

/~ r \ fo if min(a, 6) < T, 

( a-p ,o-p) = <. 

\ / \ut — Ut-i if min(a, 6) > r. 

We now let the inner product on R^ ® M^ ® M^ be 

{{xi,X2,X3),{yi,y2,y3)) = {xi,yi) - {x2,y2) + (0:3,2/3), 

and for each x e [0, 1], let 

x-p — {x-p, x-p, x-p) e M^^. 

Then for any a, 6 G [0, 1], 

min(a, b) if a, b are in different intervals of V, 

(a-p,b-p) = I \ut-i, if min(a, 6) < r 

.„ . . ,^ , if a,6e (Mt_i,ut), 

Mt, II mm(a, b) > t 

and the error X — min(a, b) — {a-p, b-p) (depending on V , a, b) satisfies 

{0 if a, 5 in different intervals, 

a — Ut-i if min(a, b) < t, a,b E (mj-i, Ut), 
a — Ut if min(a, b) > t, a,b E {ut-i,ut). 

Thus 

\X\ <ut- ut-i = length(ut_i, Ut), 

the length of the interval. We now calculate the expected value and variance of X . Without loss of 
generality a <b. Then easily 

E{X) = Prob(a < T)(a- ut_i) + Prob(a > T)(a -ut_i) 

= (a-Ut-i)-\ (a-ut) 

Ut - Ut^l Ut - Ut^l 

= 0, 
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and 



E{X^) = Prob(a< r)(a-Ut_i)2 + Prob(a > T)(a-ut_i)2 



ut~ a 



■{a-ut-i) + 



2 a-ut-i 



_ (g- ut-i){ut -q) 

Ut - Ut-l 

= {a - ut-i)iut - a) 
1 



(a- Mt)^ 



(a- Mt_i - a + ut) 



< ^(wt - ut-i)^, since Ut_i < a <ut. 

For each coordinate fc = 1, . . . , n, we'll (soon) choose a different partition Vk of [0, 1] into Nk intervals, 
with an independent random threshold for each interval in each Vk- For each cc = (.x(l), . . . , x{n)) we let 



x = ix(l)^^,...,x(n)^J e 



p37Vi 



p3Af„ 



Thus for each pi we now have a Pj = (p^ , • • • ,Pi ) G K'^^'==i ^'^ (where in each coordinate a different 
partition is used, but not denoted anymore). Then we approximate the identity 



-l + 2^min(pf' , 



(fe) Jk) 



Pi 



k=l 



by 



A=[-1 + 2{p,,pX" = 



-1 + 2E r\ 



fe=i 



*,i=i 



TTiC^) Tr:('^)\ 



P 



^ I 



-I ij = l 



and we let Xi_j^k — min (p^- ' ,p- '^ — \p\ ,Pj )■ Note that {Xi_j^k '■ k — l,...,n} are independent 

random variables for fixed i,j, since Xij^k = if p^- and pij are in different intervals oiVk, or depends 
on a single threshold, with these thresholds independent as fc = 1, . . . , n. 

We want to apply the rank lemma to A, so we start estimating (^^ flji)^ and ^^ a? . First of all, 

n 

a. = -l + 2E(pf^pf^) 
fe=i 



-l + 25]min(pf,pf))-2E^M.fc 
fe=i 

n 

1^2 y ^ Xi^i^k- 



k=l 



k=l 



Similarly, for i ^ j^ 

n 
k=\ 

Since |Xi,i^fc| < length(/i;), where Ik is the interval of Vk containing pj , we obtain 

n n 



k=\ 



k=\ 
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and 



^a»» 


= 


m - 


-^i:>:^.,. 


i=l 






i=l fe=l 




> 


m - 


-2)_^>;iength(/fe) 




= 


m - 


-2) ;hit(/)length(/), 



(19) 

wehre the last sum is over all intervals of all Vk, and hit(/) is the number of p^- ' "hitting" /, i.e., the 
number of ordered pairs (z, k) with pi e /. 

Secondly, for i ^ j, afj = 'i{Y^t=i ^ij,k)^j hence i?(afj) = 4^)J^^ ^(-^ij fc) "^^^ ^° independence and 
E{Xi^j^k) = 0. Similarly, 



£;(a|) = l-4^i?(X,,,,fc)+4^i?(X5,,) 
fe=i 

n 

= 1 + 4^S(X2,_,). 



fe=i fe=i 



Thus 



ij — 1 i— 1 ^ji— 1 



2 ^ 

ay) 



m n m n 

= m - ' 



4EE^(^va)+4EE^(^'.^^) 



z^l /c^l ^J — 1 A;— 1 

m n 

"^+4EE^(^5,fc) 

ij — 1 /c— 1 
771 n 



- '^+ E y^length(/,j^fc)^, with I^j^k € Pfe such that pl\pf' G I^j^fc if 
i,j=i fe=i it exists, otherwise take length(/ij-.fc) — 0, 

^hit(/)2length(/)2, 
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where the last sum is again over all intervals. Because we have found an upper bound on the expected 

n 2 



value of X]ri=i af,7 it follows that there exists a choice of thresholds such that 



Y^ al < 777 + ^ hit(/)2 length(/)2. (20) 

Fix such a choice of thresholds. (So at this stage we leave randomness behind.) The rank lemma together 
with l^ni and (jJOJ gives 



rank A > 
if 



(777 - 2 ^j hit(/) length(/)) 
7r7 + X;/hit(/)2 1ength(/)2 



2 



777 

/ 



2 Y^ hit(/) lcngth(/) > 0. (21) 
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On the other hand, rank A < 1 + X]fc=i ^^fc = 1 + 3^^^ 1, thus 



(TO-2 2^hit(/)length(/)) 
m + ^^hit(/)2 1ength(/)2 



2 






<l + 3> 1. (22) 



(k) 

Finally, we fix the partition Vk for each coordinate k. We assume that for each fc = 1, . . . , n, all pj 

(k) 

are distinct. (This is not essential: the p] may all be perturbed by a sufficiently small amount so as to 
weaken the left-hand side of (|22|l by at most e > 0, for any e > 0.) 

We now assume that both m and n are powers of 4. If we can prove m < en log n in this case, then 
m < 16cnlogn in general, since we may round m down to a power of 4 (thus dividing m by at most 
4) , and n up to a power of 4 (thus multiplying n by at most 4) . All the logarithms until the end of the 
proof will be base 2 (and indicated as such). We first divide [0, -] into y/m/n equal parts (thus each of 

length l/i/mn). These are called base intervals. Without loss of generality no pi is an endpoint of a 
base interval, again by making an infinitesimal perturbation. 
Then start at t = 1 and let t go down until 

hit(fi, 11) length(ft, 11) > ^^^^ 

m 

or until t — 2^^^^, whichever comes first. (The constant c will be fixed later; c = 10000 is sufficient.) 
If the stopping point fi equals some p- , we go down slightly more, without increasing hit([t, 1]). Then 
start with a new interval at ii and go down until 

, . / r XX, . ,r IX en Ig n 

hit([t,ii))length([t,ii]) > ^ 

m 

or until t = 2^^'"^, whichever comes first. In general we go down until 

enlgn 



hit([i,t,_i))length([t,i,_i))> 



m 



or until t = 2^^/^, for each s = 1, . . . ,31gn. 

If hit(/) length(J) > enlgn/m, we call / a regular interval, otherwise a singular interval. 
Thus for a singular interval / we have 

en Is Ti 

hit(/)length(/) < , / singular. 

m 

For a regular interval, if the stopping point did not hit a p^ , then hit(/) length(/) ~ cnlgn/m. If the 
stopping point did hit a, p^ , then hit (/) length (/) > cn\gn/m, but 

enlgn 



(hit(/)-l)length(/) < 



m 

Thus 

en Iet n 

hit (/) length (/) > , / regular, 

m 

and 

hit(/) length(/) < ^- if hit(/) > 2 for regular /. 

m 

Later we'll have to take care of regular / with hit(/) = 1 separately: for them 

length(/) < y^ length(/) = n. 
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i: 


length(/)2 


< 


i: 


I regular 






I regular 


hit(/)=l 






hit(/)^l 



There are at most 3nlgn singular intervals. We now bound the number of regular intervals. For each 
regular interval /, let sj be the s for which / C [2-*/^, 2^(^-1)/^]. Then 

Y^ 2-''^^hit{I) < Y^ min(/)hit(/) 

regular / regular / 

< \J y^ min(/), where k is such that I £Vk, 

I i=l 

m m n 

< E E^i'^ = EE^i'^ = - (23) 

We now bound Ereguiar / 1/(2""^/^ hit(/)). For each s = 1, . . . ,31gn, 

y^ length(/) < (2^(*^i)/^ — 2^'*/^)n (since for each coordinate the /'s don't overlap) 



regular / 

Sj—S 



(2l/3 „ 1)22/3^ < 12-^/3^. 



Since length(/) hit(/) > cnlgn/m for regular /, 



1 m length(/) 



Thus 



hit(/) cnlgn 



^ 2-^/3 hit(/) - 2-^/3! Ign ^ ^°"S*^('^ 

regular / regular / 

Sj—S Sj—S 

m 

< 



3clgn 



Thus 



By Cauchy-Schwartz, 



3 Ig n 

^ 2-'^^/3 hit(/) ^ ^ Sclgn " 7' ( ' 



regular 



E - E \/2--/^hit(/)-=l 



, r / \ , r vF^^73hitm 

^ regular 1 / \ regular i V \ / 

\ regular 7 / \ regular / 

< m— by (|23) and |23, 

and the number of regular intervals is less than raj \fc. Thus the total number of non-base intervals is at 
most 3n Ign + mj ^fc (and recall that the number of base intervals is \pmn). We are now in a position to 
estimate the various quantities in 1)22(1 . We want to show that these estimates imply m < cnlgn, so we 
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assume that m > cnlgn and aim for a contradiction if c is sufficiently large (10000 will do). First of all, 

1 + 3^1 < l + Sf y/rnn + Snlgn + ^ 

3m 9m 3™ 

Vclgn c a/c 

16m 
< — ^ (25 

to be sure. Secondly, 

TO + yhit(/)2lcngth(/)2 = m+ y liit(/)2— + V length(/)2 

/ base / regular / 

hit(/) = l 



+ Yl (hit(/)length(/))2 



non-base I 
hi > 2 if regular 



2 1 I lo 1 1 "^ 1 ^2cnlgri 



< m + nm 1- n + [ Sn Ig n H p 

< 2m 



mn \ \/c I \ m 

m 12m Am 



clgn c vc 



17to 
< 2m H ^. (26) 



Thirdly, 



y hit (/) length (/) == y hit(/)^^+ y length(/) 

Z — ^ Z — ^ ^ /mil ^ — ^ 



y 77171 
base / regular / 

hit(/) = l 

+ ^ hit(/)length(/) 

non-base / 
h/ > 2 if regular 

1 /o 1 ^'^ A /2c7ilgn 

< 77771 = + 71 + 377 Ig 71 H -= 

\Jmn \ y/C 

m m 6m 2m 

< 



< 



^/cTgn clgn c ^ 
10m 



Thus if ^/c > 20, then pif) is satisfied, and in fact 



IO771 
2 ^ hit(/) length(/) > m ^. (27) 



Substituting estimates l|^ . if^ . and lf?7|) into if^ we obtain 

2 



I6771 



(— ^) . 16 (1-^)' 



> -; . 17^ . i-e-, ^ > 



v^ 2m+^ ' --'V-c^ 2+il ' 

which is false if c is sufficiently large, e.g. for y^c — 100. 

Thus 771 < en Ig 71, and the proof is finished. D 
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9 Final remarks 

9.1 Infinite dimensions 

In infinite dimensions almost nothing is known. The most important open question here is the foUowing. 

Problem 6. Does there exist an infinite equilateral set in any separable infinite- dimensional normed 
space ? 

As observed by Terenzi ^| , it foUows from the partition calculus of set theory that a normed space 
of cardinality at least 2'^ has an infinite equilateral set (in fact an uncountable one), where c ~ 2^° is the 
cardinality of the continuum. The following two easy results are proved in |H5I . 

Theorem 14. Let e > and X an infinite dimensional Banach space. Then X has an equivalent norm 
with Banach-Mazur distance of at most 2(1 +e) to the original norm, admitting an infinte equilateral set. 

Theorem 15. Let e > and X an infinite dimensional superreflexive Banach space. Then X has an 
equivalent norm with Banach-Mazur distance of at most 1 + e to the original norm, admitting an infinite 
equilateral set. 

9.2 Generalizations 

An e- almost- equilateral set in a normed space is a set S satisfying 1 — e < ||cc — j/|| < 1+e for all x,y Cz S, 
x^y. Such sets have been studied in [SI El El El El- 

Problem 7. Prove that for each e > there exists S > .such that each n-dimensional normed space has 
an e-almost-equilateral set of size (1 + S)" . 

Equilateral sets may be generalized to the notion of a fc-distance set, i.e., a subset S* of a metric space 
such that {d{x,y) : x,y d S,x j^ y} consists of at most k numbers. Then the Euclidean case is already 
non-trivial, and for normed spaces a few results are known: see |3()| . The following conjecture would 
generalize Petty's theorem: 

Conjecture 3 ([36]). A k-distance set in an n-dimensional normed space has size at most (k + 1)". // 
equality holds for some k > 1, then the .space must be isometric to P^. 

It is known to be true for n = 2 and for £J^ , while for general Minkowski spaces there are some weaker 
estimates; see ^Hj . 
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